104 results found.
Language Type:
Multilingual
Languages:
Hindi
Availability:
From Owner
License:
CreativeCommons
Size:
480 MByte Production Status:
Newly created-in progress
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Hindi
Availability:
From Owner
License:
<Not Specified>
Size:
16Mbyte Production Status:
Newly created-in progress
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
Some documentation is available, but much more is planned. All of it will be in English and will be publicly available.Language Type:
Multilingual
Languages:
Hindi
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
200,000 sentence-pairs Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Hindi
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
Bangali Hindi Telugu
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Transliteration test bench
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Monolingual
Languages:
Hindi
Availability:
From Owner
License:
Size:
13.6 MByte Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:A Platform for Event Extraction in Hindi
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sovan Kumar Sahoo | Hindi_Event | /N |
Documentation:
NO
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German Hindi Italian Persian
Availability:
Freely Available
License:
Creative Commons - Attribution-{NonCommercial}-{ShareAlike} 4.0 International ({CC} {BY}-{NC}-{SA} 4.0)
Size:
162M sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:LSCP: Enhanced Large Scale Colloquial Persian Language Understanding
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mahdi Bohlouli | Large-Scale Colloquial Persian 0.5 | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Hindi
Availability:
Freely Available
License:
MIT
Size:
1 MByte Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:An Annotated Dataset of Discourse Modes in Hindi Stories
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Debanjan Mahata | Hindi-Discourse-Modes | /N |
Documentation:
https://github.com/midas-research/hindi-discourse
Written
Corpus,
Language Type:
Multilingual
Languages:
Bengali Gujarati Hindi Kannada Malayalam Marathi Punjabi Sindhi Sinhala Tamil Telugu Urdu
Availability:
Freely Available
License:
CreativeCommons
Size:
2 GByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Brian Roark | Dakshina dataset | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
Afrikaans Akkadian Amharic Ancient Greek Arabic Armenian Assyrian Bambara Basque Belarusian Bhojpuri Breton Bulgarian Buryat Cantonese Catalan Chinese Classical Chinese Coptic Croatian Czech Danish Dutch English Erzya Estonian Faroese Finnish French Galician German Gothic Greek Hebrew Hindi Hindi English Hungarian Indonesian Irish Italian Japanese Karelian Kazakh Komi Permyak Komi Zyrian Korean Kurmanji Latin Latvian Lithuanian Livvi Maltese Marathi Mbya Guarani Moksha Naija North Sami Norwegian Old Church Slavonic Old French Old Russian Persian Polish Portuguese Romanian Russian Sanskrit Scottish Gaelic Serbian Skolt Sami Slovak Slovenian Spanish Swedish Swedish Sign Language Swiss German Tagalog Tamil Telugu Thai Turkish Ukrainian Upper Sorbian Urdu Uyghur Vietnamese Warlpiri Welsh Wolof Yoruba
Availability:
Freely Available
License:
Various
Size:
25 million words Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joakim Nivre | Universal Dependencies | /N |
Documentation:
https://universaldependencies.org




